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Abstract — The reconstruction problem for permutations on n 
elements from their erroneous patterns which are distorted by 
transpositions is presented in this paper. It is shown that for 
any n > 3 an unknown permutation is uniquely reconstructible 
from 4 distinct permutations at transposition distance at most 
one from the unknown permutation. The transposition distance 
between two permutations is defined as the least number of 
transpositions needed to transform one into the other. The 
proposed approach is based on the investigation of structural 
properties of a corresponding Cayley graph. In the case of at most 
two transposition errors it is shown that |(7i— 2)(7i+l) erroneous 
patterns are required in order to reconstruct an unknown 
permutation. Similar results are obtained for two particular cases 
when permutations are distorted by given transpositions. These 
results confirm some bounds for regular graphs which are also 
presented in this paper. 

I. Introduction 

Efficient reconstruction of arbitrary sequences was introduced 
and investigated by Levenshtein for combinatorial channels 
with errors of interest in coding theory such as substitutions, 
transpositions, deletions and insertions of symbols [1], [2]. 
Sequences are considered as elements of a vertex set y of a 
graph r — {V,E) where an edge {x,y} £ E is viewed as 
the single error transforming x into y £ V. One of the metric 
problems which arises here is the problem of reconstructing an 
unknown vertex x € V from a minimum number of vertices 
in the metric ball Br{x) of radius r centered at the vertex 
a; G y. It is reduced to finding the value 

iV(r,r)= max n (1) 

since N{r,r) + 1 is the least number of distinct vertices 
in the ball Br{x) around the unknown vertex x which are 
sufficient to reconstruct x subject to the condition that at 
most r single errors have happened. As one can see, this 
problem is based on considering metric balls in a graph 
but it differs from traditional packing and covering problems 
in various ways. It is motivated by a transmission model 
where information is realized in the presence of noise without 
encoding or redundancy, and where the ability to reconstruct 
a message (vertex) uniquely depends on having a sufficiently 
large number of erroneous patterns of this message. 

The value ([T]i was studied for the Hamming and Johnson 
graphs [2]. Both graphs are distance-regular and the first is 



a Cayley graphs. The problem of finding the value ([T]i is 
much more complicated for graphs which are not distance- 
regular Cayley graphs of this kind arise for instance on the 
symmetric group and the signed permutation group, when 
the reconstruction of permutations and signed permutations 
is considered for distortions by single reversal errors [3], [4]. 

In this paper we continue these investigations and consider 
the reconstruction problem for permutations distorted by single 
transposition errors which consist of swapping 1) any two 
elements of a permutation; 2) any two neighboring elements 
of a permutation; and 3) the first and any other element of a 
permutation. The corresponding graphs are the transposition 
Cayley graph, the bubble-sort Cayley graph and the star 
Cayley graph. They are regular but not distance-regular We 
investigate the combinatorial properties of these graphs and 
present the values ([T]) when r = 1,2 in each case. Some 
bounds on N{V, 1) and N^T, 2) for regular graphs are also 
considered. It is shown that the bubble-sort and star Cayley 
graphs are examples for which these bounds are attained. 

II. Definitions, notation, general results 

Let G be a finite group and let 5 be a set of generators of 
G such that the identity element e of G does not belong to S 
and such that S = where S^^ = {s~^ : s € S}. In the 
Cayley graph T = Cay{G, S) — {V, E) vertices correspond to 
the elements of the group, i.e. V = G, and edges correspond to 
multiplication on the right by generators, i.e. E — {{g,gs] : 
g E G,s E S}. Denote by d{x, y) the path distance between 
the vertices x and y in F, and by d{V) = ma.x{d{x, y) : 
x,y E V} the diameter of F. In other words, in a Cayley 
graph the diameter is the maximum, over g E G, of the length 
of a shortest expression for 5 as a product of generators. For 
the vertex x let Sr{x) — {y E V : d{x, y) ~ r} and Br{x) — 
{y EV : d{x, y) < r} be the sphere and the ball of radius 
r centered at x, respectively. The vertices y E Br{x) are r- 
neighbors of the vertex x. 

As mentioned in the Introduction, the value ([T]i was in- 
vestigated initially for distance-regular graphs such as the 
Hamming and Johnson graphs. Let us recall that a simple 
connected graph F is distance-regular if there are integers 
6i,Ci for i > Q such that for any two vertices x and y 
at distance d{x, y) — i there are precisely a neighbors of 



y in Si^i{x) and hi neighbors of y in Si+i{x). Evidently 
r is regular of valency k — b^, or k-regular A fc-regular 
simple graph T is strongly regular if there exist integers 
A and /i such that every adjacent pair of vertices has A 
common neighbors, and every nonadjacent pair of vertices has 
/I common neighbors. 

The Hamming space F^' consists of the vectors of length 
n over the alphabet {0, 1, ...,q — 1}, q > 2. It is endowed 
with the Hamming distance d where d{x, y) is the number of 
coordinate positions in which x and y differ. It can be viewed 
as a graph Ln{q) with vertex set given by the vector space 
FJ^ (where Fq is the field of q elements) where {x, ?/} is an 
edge of Ln{q) iff d{x,y) = 1. This Hamming graph is the 
Cayley graph on the additive group F^' when we take the 
generator set S — {xci : x G (Fq)^ , I < i < n} where the 
Ci = (0, 0, 1, 0, ...0) are the standard basis vectors of F^. 
It was shown in [1], [2] that for any n > 2, q > 2 and r > 1, 

7V(L„(Q),r)=g^('"^M(g-ir. (2) 

For the particular case n = 2 the Hamming graph ^2(9) is 
the lattice graph over Fq. This graph is strongly regular with 
parameters v = q'^, k = 2{q — 1), X = q — 2, fi ^ 2, and from 
© we get N{L2{q), 1) = g and N{L2iq), 2) = q^. 

The Johnson graph J" is defined on the subset V — J" C 
F2 consisting of all {0, l}-vectors with exactly e entries equal 
to 1 , for a fixed 1 < e < n — 1 . On J" the Johnson distance is 
defined as half the (even) Hamming distance, and two vertices 
X, y are joined by an edge iff they are at Johnson distance 1 
from each other. In general J" is not a Cayley graph although 
the notion of errors being represented by edges makes sense 
all the same. In particular, two vertices are at distance 1 from 
each other iff one is obtained from the other by the interchange 
of two coordinate positions. In [1], [2] it was shown that 

for any n > 2, e > 1 and r > 1. In the particular case e — 2 
and 71 > 4 the Johnson graph J2 is the triangular graph T{n). 
As vertices it has the 2-element subsets of an n-set and two 
vertices are adjacent iff they are not disjoint. This graph is 
strongly regular with parameters v = ^ fc = 2(n — 2), 

\ — n — 2, jj. — 4, and from ^ we obtain N{T{n), 1) — n 
and iV(T(n),2) = 

These two results were the first analytic formulas for the 
reconstruction problem we are interested in. Their uniformity 
depends on the fact that these graphs are distance-regular. 
What then are the general results for simple graphs, regular 
graphs and Cayley graphs? We start with a few observations 
from [5] for any connected simple graphs T — {V,E). In the 
spirit of distance regularity we put ki{x) = \Si{x)\ and define 
numbers Ci(x,y), bi{x,y) and ai{x,y) for any two vertices 
X G y and y € Si{x) such that 

Ci{x,y) = \{z e Si-i(x) : d{z,y)^l}\, 



bi{x,y) = \{z e Si+i{x) : d{z,y) = l}\, 

ai{x,y) = \{z e Si{x) : d{z,y)^l}\. 

From this ai(x,y) — ai{y,x) is the number of triangles 
over the edge {x,y} and C2{x,y) is the number of common 
neighbors of x Q V and y G S2{x). Let 

A = A(r) = max ai{x,y) (4) 

x£V., y£Si(x) 

^ = /i(r) = max C2{x,y). (5) 

xeV, yGS2{x) 

Since \Br{x) D Br{y)\ > for x ^ y if and only if 1 < 

d{x, y) <2r = d{r), we have 

iV(r,r) = max Ns{T,r) (6) 

l<s<2r 

where Ns{T^r) = max{|_Br(2:) n Br{y)\ : a;,y G 

V\ d{x,y) = s}. In particular, iVi(r, 1) = A + 2 and 
7V2(r,l) = nso that 

iV(r, 1) =max(A + 2,/^). (7) 

One can easily check that using this formula for the lattice 
graph £2(9) and the triangular graph T{n) we obtain again 
the earlier formulas ^ and Indeed, since A = n — 2 and 

= 4 for T{n),n > 4, we have NiT{n), 1) = n from ©. 
By the same reason we have N{L2{q), I) ~ q since X = q — 2 
and ^ = 2 in this case. 

We have no general results for A^(r, r) when F is a regular 
graph. The numbers Ci{x,y) and bi{x,y) usually depend 
on y G Si{x) and this causes difficulties when searching 
for general estimates of iV(F,r). However, some bounds on 
N{T, 1) and iV(F, 2) were obtained in [5]. Here it is assumed 
that F is connected, fc-regular of diameter c?(F) > 2 with v > A 
vertices and parameters 0<A<fc — 2, 1</L(<fc, where 
2 < fc < w - 2. 

Theorem 1: For any fc-regular graph F, 

A^(r,l) < i(« + A). (8) 

This theorem is proved by checking that A + 2 < ^(u + A) 
and /i < i(t) + A). The first inequality takes place since k < 
V ~ 2 and A < fc — 2. Moreover, there is equality only if 
A = V — 4 and fc = u — 2. The second inequality is true 
since counting edges between (x) and S2 (x) for any x E V 
we have J2yeS,(x)('^ " 1 " aiix,y)) = Ezes^ix) C2{x, z). 
From (|4]i, ^ and the fact that fc2(x) < w — fc — 1 we get 
fc(fc — 1 — A) < fJ.k2{x) < ijl{v — fc — 1) with equality if and 
only if F is strongly regular Let us note here that the equality 
fc(fc — 1 — A) = niy — fc — 1) is well-known for strongly 
regular graphs. From this and the fact that 1 < /i < fc we 
have fc — 1 — A<u — fc — 1 and hence /i < fc < ^(t; + A) 
is valid for any regular graph F. By taking into account these 
two inequalities for A and ^ we get ([8]) from d?). Moreover, ([8]) 
is attained on the strongly regular i-partite graph K^^^^ with 
t{k — A) vertices partitioned into t >2 parts, where t = "^^E^ 
is an integer, and with edges connecting any two vertices of 
different parts. 



Theorem 2: For any /c-regular graph F we have 

iV2(F, 2)>pi{k-\-\{ii- 1)(7V(F, 1) - 2)^ + 2. (9) 

In proving (|9]) the Hnear programming problem arises for the 
vertex subset IJ = Uf=i Bi{zi) \ {x, y}, where x,y G V with 
d{x, y) = 2 and z^, i = 1, /i, are the vertices at distance 1 
from both x and y. The task is to minimize \ U\ = J2h=i "'i ^^i" 
nonnegative numbers u/j satisfying the following conditions 

h=l 

E"/./^^) < Q(A^(G,l)-2), 

where uji — \U{h)\/h, and U{h) is the set of vertices in U 
belonging to h sets Bi{zi), i = 1, 

The details of the proofs for Theorems 1 and 2 can be 
found in [5]. From the last theorem one can immediately get 
the following corollaries. 

Corollary 1: For a fc-regular graph F, 

(i) if = 1, then N2{T, 2) > fc + 1; 

(ii) if /X = 2 and iV(r, 1) = 2, then N2(T, 2) > 2k; 

(iii) if /i = 3 and iV(r, 1) = 3, then A^2(r, 2) > 3fc - 5. 

Corollary 2: Let F be a fc-regular graph without triangles 
or pentagons, with /i > 2 and fc>l + |(/i — Then 

iV2(r,2) > 7Vi(F,2). (10) 

Actually, since F does not contain triangles or pentagons 
we have 7Vi(F, 2) = 2k and iV(F, 1) = by Q since A = 
and IX > 2. Using (|9]) we get 

7V2(r, 2) - 2fc > (Ai - 2)(fc - 1 _ ^(^ - 1)^) > 0, 

and finally we obtain ( fTOl i. 

In the remainder of this section it is assumed that F = 
Cay{G, S) is a Cayley graph on the group G for the generator 
set S. Let us put S° = {e} and set 5* = 55'" ^ Moreover, by 
vertex-transitivity it is sufficient to consider only the spheres 
and balls with center e so that Si = Si{e). 

Lemma 1: For any Cayley graph F on the group G and for 
i > we have S, = 5' \ (5*-i U5'-2 u ... U5°). In particular, 
ji is the maximum number of representations of an element in 
5^ \ (5 U 5") as a product of two elements of 5 and A is the 
maximum number of representations of an element in 5 as a 
product of two elements of 5, i,e. 

A(F) = max | {{siSj) E : s — SiSj} |, 

/i(F) = max I UsiSj) £ S'^ : s — s,;s,-| I . 

ses^\(sus°) ' '-^ ' 

This lemma allows us to find A^(F, 1) from ^ for a general 
Cayley graph. The results for estimating the values A^(F, r) 
for small r in Cayley graphs on the symmetric group Synin 
will be presented in the next section when the generator set 5 
consists of transpositions. 



III. The reconstruction of permutations in Cayley 

GRAPHS GENERATED BY TRANSPOSITIONS 

Let Synin be the symmetric group on n symbols. We write 
a permutation tt in one-line notation as tt = [tti, 7r2, . . . , 7r„] 
where tt^ — 7r(i) for every i G {1, . . . , n}. 

For the transposition Cayley graph Symn(T) on Syvrin 
the generator set consists of all transpositions T = {tij G 
Synin, I < i < j < n}, \T\ = (2) , where tij inter- 
changes positions i and j when multiplied on the right, i.e., 

[...,TTi,...,TTj,...] ■ ti^-j = [. . . .TTj, . . . ,7rj, . . .]. FoT X, y G 

Synin the distance d{x, y) is the least number of transpositions 
ii, ...,tr such that x-ti- ...-tr = y^or ti- ...-tr — x~^ -y. As, any 
/c-cycle can be written as a product of fc — 1 transpositions (but 
no fewer), the diameter of 5j/to„(T) is (n — 1). The graph 
is bipartite since any edge joins an even permutation to an 
odd permutation. The symmetry properties of 5ym„(T) have 
been discussed in [6]. The graph is edge-transitive but not 
distance-regular and hence not distance-transitive. All these 
properties and other basic facts are collected in the following 
statements. 

Lemma 2: The transposition graph Symn{T), n > 3, 

(i) is a connected bipartite (2)-regular graph of order nl and 
diameter (n — 1); 

(ii) is not distance-regular and hence not distance-transitive; 

(iii) it does not contain subgraphs isomorphic to ^^2,47 and 
each of its vertices belongs to (3) subgraphs isomorphic to 

(Here Kp g is the complete bipartite graph with p and q 
vertices in the two parts, respectively.) 

Theorem 3: For any n > 3 we have N{Synin{T), 1) = 3. 

This means that any unknown permutation is uniquely recon- 
structible from 4 distinct permutations at transposition distance 
at most one from the unknown permutation. The proof of these 
statements is based on considering a permutation tt G Synin in 
cycle notation, with cycle type ct(7r) — 1^^2^'^ .. n''", where hi 
is the number of cycles of length i. In particular ihi ~ n. 
The permutation tt can be also presented as a product of a 
least number of transpositions. Each such product represents 
a shortest path in Synin{T) from e to tt. The number of 
such paths was obtained in [7]. This result is based on Ore's 
theorem on the number of trees with n labeled vertices and 
presented by the following theorem. 

Theorem 4: [7] Let tt G Synin have cycle type ct(7r) = 
l'''^2'^^ ...n'^"', consisting of = n — i cycles where 

1 < i < n — 1. Then the number of distinct ways to express 
TT as a product of i transpositions is equal to 

According to the above theorem, the following lemma gives us 
formulas for the numbers Ci(7r) := Ci(7r, e), 5,;(7r) := 6i(7r, e), 
ai(vr) = ai{TT, e),l < i < n — 1 where e is the identity element 
of the transposition Cayley graph Synin{T). 



Lemma 3: In the transposition graph Symn{T) the sets 
= S'i(e), 1 < i < n — 1, are the permutations consisting of 
{n — i) disjoint cycles, counting also 1-cycles. For any tt £ Si 
with cycle type ct(7r) — l'''^2^^ ...n^" , we have ai(7r) = 
and 



In particular, since ai(7r) = for any 1 < i < n — 1, then from 
this lemma and by (|4]l we have \{Symn{T)) = 0. Moreover, 
it is well-known that two permutations are conjugate by an 
element of Synin if and only if they have the same cycle 
type. If (l''i2''2... n'*")'^ denotes the conjugacy class of an 



element of cycle type l''i2''2. 



then it is shown in [5] 



that Si, 1 < i < n — 1, is the disjoint union 

5*,= y ...n^-^^f, (11) 



where 



h2 



/ii+/i2H \-hn—n — i 



l''i/ii!2''2/i2! ■■■n 



(12) 



Hence, from (dU we have 5*2 = (1""^ S^)'^ U (l""'' 2^)^ 
and then by Lemma [3] we get €2(71) = 3 if rf(7r) = 1"^'^3^, 
and C2(7r) 2 if ct(7r) = l"-''22. From these and ^ 
we have ii{Symn{T)) = 3, and therefore, by O we get 
Theorem [3] Moreover, there are no subgraphs isomorphic to 
K2A in Syran{T) since ^{Symn{Tj) = 3. The number (3) 
of subgraphs isomorphic to Ks,^ and having e as one of its 
vertices is obtained from (fTSl i for any tt e 

(l"-3 3i)G. By 

vertex-transitivity the same holds for any vertex in Synin [T) 
(see condition (iii) in Lemma |2]i. 

So, any unknown permutation is uniquely reconstructible 
from 4 distinct permutations at transposition distance at most 
1 from the unknown permutation. As the following shows, in 
the case of at most two transposition errors the reconstruction 
of the permutation tt requires many more distinct 2-neighbors 

of TT. 



Theorem 5: For n > 3 we have 



N{SymrXT),2) = -(n-2)(n + l). 



(13) 



This follows from the fact that the normalizer of T is G = 
Synin itself and from the following lemma. 

Lemma 4: For any tt ^ Si, 1 < i < n — \ the number 
of vertices in {\^^2^^ ...vP''^)'^ at a given distance from tt 
depends only on the conjugacy class to which tt belongs. 

To prove Theorem |5] it is therefore sufficient to consider the 
numbers of vertices in all subsets of -62(6) at minimal distance 
at most 2 from a given vertex tt £ S",, 1 < i < 4. By (fTTl i 
we have 5i = {l'^-^2^ f, S2 = (l"-3 3i)G ^J (i"-4 22)G^ 



^-^ ^ (l"-4 4i)G U (l"-5 2i3i)Gu (1"-6 23)G^ _ 

(l"-5 5l)Gu (l"-6 2l4l)'^U (l"-6 32)Gy (in-7 22 3l)Gy 

24)G. It is shown that Ni{Symn{T), 2) = 20 for n > 5, 

N^(Symn{T),2) = 12 for n > 4, N2{Symn{T),2) = 



f (n - 2)(n + 1) and 7Vi(S'ym„(T), 2) = n{n - 1) for all 
n > 3. From these and by (|6]l one can conclude ( fT3] ). 

The statements of Theorem |5] and Corollary |2] are general- 
ized in the following conjecture. 

Conjecture 1: For any tt G (1"^'^3^)'', for any ?■ > 1 and 
n > 2r + 1 we have 

N{Symn{T),r) ^ N2{Symn{T),2) = f| B,(7r)|. 

Now let us consider the bubble-sort graph Symn{t). This 
is the Cayley graph on Synin for the generator set t = 
S Synin, I <i < n}, \t\ = n—1. These bubble-sort 
transpositions are 2-cycles t^^^+i interchanging i and i + 1 and 
determine the graph distance in Synin{t) in the usual way. It 
is known that the diameter of Symn{t) is (2) • 

Lemma 5: The bubble-sort graph Synin{t), n > 3, 

(i) is a connected bipartite [n — l)-regular graph of order n\ 
and diameter (2) ! 

(ii) it does not contain subgraphs isomorphic to i^2,3; 

(iii) each of its vertices belongs to ("2^^) ,n>^, subgraphs 
isomorphic to K2.2- 

The symmetry properties of the bubble-sort graph were dis- 
cussed in [6] where it was shown that this graph is not 
distance-regular. As it is bipartite there are no triangles and 
hence \{Synin{t)) = 0. If an element tt G 'S'2(e) has at 
least two neighbors ^ in '5'i(e) then neces- 

sarily U,i+itjj+i = vr = tj^j+iU^i+i with + 1} and 
+ 1} disjoint. It suffices to verify this for permutations 
on 4 letters. Hence there are at most two such neighbors and 
so p{Synin{t)) = 2. It can be also verified that we have 
Ni{Synin{t),2) = 4 for n > 5, N-i{Symn{t),2) = 2 for 
n > 4, N2{Synin{t),2) = Ni{Synin{t),2) = 2{n - I) for 
n > 3. From all these and by (|6]l and Q we get the following 
theorem. 

Theorem 6: For any n > 3 we have 

N{Symn{t), I) =2 and N {Syninlt) ,2) = 2{n - 1). 

Almost the same results appear for the star Cayley graph 
Synin{st) generated by the set of prefix-transpositions st = 
G Synin, 1 < i < n}, \st\ = n — 1. It is one of the 
most investigated graphs in the theory of interconnection net- 
works since many parallel algorithms are efficiently mapped 
on the star Cayley graph. 

Lemma 6: [8] The star Cayley graph Synin{st),n > 3, is 
a connected bipartite {n — l)-regular graph of order nl with 
diameter ^ ■^("~^) 

The star Cayley graph Synin{st) is not distance-regular 
for n > 4 [6] and has no cycles of lengths of 3, 4, 5 or 7. 
Hence X{Synin{st)) = and p{Symn{st)) = 1. Moreover, 
it is easy to verify that Nn{Symn{st),2) — A fox n > b, 
N3{Synin{st),2) = 4 for n > 4, Ar2 (S'ym„(st), 2) = 2{n - 
1) for n > 5 and Ni(Synin{st),2) = 2{n - 1) for n > 4. 
From these properties and by (|6]l and ^ we get the following 
theorem. 



Theorem 7: For any n > 4 we have 

N{Symn{st), 1) = 2 and N{Symn{st), 2) = 2(n - 1). 

Thus, in the bubble-sort and star Cayley graphs any un- 
known permutation tt is uniquely reconstructible from 3 dis- 
tinct 1-neighbors of tt. Similarly, for the unique reconstruction 
of TT from neighbors at distance at most 2 we see that 
any 2n — 1 distinct 2-neighbors of tt are sufficient. These 
two graphs are examples for which the inequality (ii) in 
Corollary ([T]l is attained. 
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